Project Proposal: How Verb-Particle Constructions Affect Translation Quality in English and German

نویسنده

  • Nina Schottmüller
چکیده

This project deals with verb-particle constructions (VPCs) in the context of statistical machine translation, with the purpose of finding out if and to what degree they affect translation quality in English and German. Its main goal is to identify which types of VPCs cause translation errors and to find possible solutions to this problem. VPCs are a type of multiword expressions (MWEs), meaning they are lexical items consisting of multiple simplex words that display different kinds of idiosyncrasies [4]. Even though German and English share many common features and both languages exhibit a variety of particle verbs, their behaviour is not always similar. First of all, there are grammatical differences. For example, in English the particle is always separate from the verb root (e.g. eat up), while in German, it can be separated, but does not necessarily have to be (e.g. aufessen: er isst den Kuchen auf ). Secondly, both languages have different VPCs in general. Even though there are cases where VPCs are formed in the same manner (e.g. eat up and aufessen), there are also cases where they are formed with different particles (e.g. step down and zurücktreten), or different verbs (e.g. turn down and ablehnen). While some VPCs translate to other VPCs, there are also particle verbs that translate to simplex verbs or to several alternatives that convey the same meaning, as the following example shows (taken from Bing Translator). Mach die Tür auf! → Open the door! (1)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

How to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation

Support-verb constructions (i.e., multiword expressions combining a semantically light verb with a predicative noun) are problematic for standard statistical machine translation systems, because SMT systems cannot distinguish between literal and idiomatic uses of the verb. We work on the German to English translation direction, for which the identification of support-verb constructions is chall...

متن کامل

Issues in Translating Verb-Particle Constructions from German to English

In this paper, we investigate difficulties in translating verb-particle constructions from German to English. We analyse the structure of German VPCs and compare them to VPCs in English. In order to find out if and to what degree the presence of VPCs causes problems for statistical machine translation systems, we collected a set of 59 verb pairs, each consisting of a German VPC and a synonymous...

متن کامل

Discontinuous Verb Phrases in Parsing and Machine Translation of English and German

In this paper, we focus on the verb-particle (V-Prt) split construction in English and German and its difficulty for parsing and Machine Translation (MT). For German, we use an existing test suite of V-Prt split constructions, while for English, we build a new and comparable test suite from raw data. These two data sets are then used to perform an analysis of errors in dependency parsing, word-...

متن کامل

Context Resolution of Verb Particle Constructions for English to Hindi Translation

Verb Particle Constructions (VPCs) are flexible in nature and hence quite complex and challenging to handle. As a consequence, VPCs generate a lot of interest for NLP community. Despite their prevalence in English they are not handled very well, and hence often result in poor quality of translation. In this paper we investigate VPCs for English to Hindi translation. An English VPC can have diff...

متن کامل

Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate

This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish. We classify support verb constructions by means of their syntactic structure and se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013